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Preface 


The  reader  who  is  not  familiar  with  Pascal’s  triangle  should  be  warned 
that  it  is  not  a  geometric  triangle  with  three  angles  and  three  sides.  What 
we  call  Pascal’s  triangle  is  an  important  numerical  table,  with  the  help  of 
which  a  number  of  computation  problems  may  be  solved.  We  shall 
examine  some  of  these  problems  and  shall  incidentally  touch  upon  the 
question  of  what  “solving  a  problem”  can  mean  in  general. 

This  exposition  requires  no  preliminary  knowledge  beyond  the  limits 
of  the  eighth-grade  curriculum,  except  for  the  definition  of  and  notation 
for  the  zeroth  power  of  a  number.  That  is,  one  must  know  that  any  non¬ 
zero  number,  raised  to  the  zeroth  power,  is  considered  (by  definition !) 
to  be  equal  to  unity:  a0  =  1  for  a  ^  0. 
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A  Problem 
from  the 

Eighth  Olympiad 


Duringthe  Eighth  Moscow  Mathematical  Olympiad  (1945),  the  follow¬ 
ing  problem  was  presented  to  the  ninth-  and  tenth-grade  participants: 1 

A  network  of  roads  is  given  (fig.  1.1).  From  point  A,  21000  men  set  out. 
Half  go  in  direction  /,  half  in  direction  m.  Reaching  the  first  intersection, 

each  group  divides;  half  go  in 
direction  /,  and  half  in  direction  m. 
Such  a  division  takes  place  at  each 
intersection.  How  many  people 
arrive  at  each  intersection  of  the 
1000th  row?2 

First  let  us  observe  that  at  the 
moment,  we  do  not  know  whether 
the  problem  has  a  solution ;  that  is, 
whether  the  division  of  people  can 
proceed  as  required  by  the  prob¬ 
lem’s  conditions.  We  know  that  if 
an  odd  number  of  people  arrive  at 
some  intersection  at  which  the  usual  division  of  the  stream  of  people  is  to 
take  place,  then  the  division  is  blocked.  Consequently,  for  the  problem 
to  have  a  solution,  it  is  necessary  and  sufficient  that  an  even  number  of 
people  arrive  at  each  intersection  of  each  of  the  first  thousand  rows, 
from  the  zeroth  to  the  nine  hundred  ninety-ninth.  We  must  make  certain 
that  this  is  so  in  the  process  of  solving  the  problem. 

Let  us  begin  by  introducing  symbols  for  the  number  of  people  who  pass 


1.  See  A.  M.  Yaglom  and  I.  M.  Yaglom,  Challenging  Mathematical  Problems 
with  Elementary  Solutions  (San  Francisco :  Holden-Day,  1964),  1 :19,  problem  62b. 

2.  Consider  the  rows  to  be  numbered,  starting  with  the  zeroth.  Thus,  in  the 
zeroth  row,  there  is  one  intersection  (A);  in  the  first,  two;  in  the  second,  three; 
and  so  on. 
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through  each  intersection  of  our  network  of  roads.  The  intersections  of 
each  row  will  be  numbered  from  left  to  right  beginning  with  the  zeroth ; 
consequently,  the  intersections  of  the  nth  row  will  be  numbered  from  zero 
to  n.  The  number  of  people  who  pass  through  the  kth  intersection  of  the 
nth  row  will  be  denoted  by  H\.  Since  it  is  not  clear  at  the  present  time 
that  the  problem  has  a  solution,  we  cannot  be  certain  that  all  numbers 
Hnk  exist;  that  is,  the  number  Hnk  exists  for  each  k  from  0  to  n  and  for 
every  n  from  0  to  1000.  It  is,  however,  clear  that  some  of  these  numbers 
exist.  By  virtue  of  the  notation  we  have  introduced, 

H°0  =  21000 .  (1.1) 

Let  us  now  determine  how  the  numbers  H  nk  (k  =  0,  1,  2, . . . ,  n)  and 
Hn  +  1k  (k  =  0,  1,  2, . . .,  n  +  1)  are  related,  under  the  supposition  that 
they  all  exist.  We  shall  show  that  if  all  the  numbers  H\  exist  and  are 
even,  then  all  the  numbers  Hn  +  1k  exist.  Let  us  examine  the  nth  and 
(n  +  l)th  rows  of  intersections  and  the  road  segments  which  connect 
them.  At  each  intersection  we  place  the  appropriate  symbol  for  the  num¬ 
ber  of  people  arriving  (see  fig.  1.2).  The  number  of  people  who  enter  the 


Fig.  1.2 


zeroth  intersection  of  the  nth  row  (that  is,  Hn0 )  is  divided  by  two,  and 
only  half  of  these  people  enter  the  zeroth  intersection  of  the  (n  +  I)st 
row;  therefore, 


#  n  +  1o  =  ^y2  '  (1.2) 

The  other  half  of  the  Hn0  people  enter  the  first  intersection  of  the 
(n  +  l)th  row  and  there  join  half  the  people  who  left  the  first  intersection 
of  the  nth  row,  who  number  Hnr\ 2. 

Therefore,  Hn  +  11  =  ( Hn0  +  Hn1)l2.  In  general,  the  number  of  people 
who  enter  the  kth  intersection  of  the  (n  +  l)th  row  is  the  sum  of  half  the 
number  of  people  who  left  the  ( k  —  l)th  intersection  of  the  nth  row 
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(or  Hnk-l/2),  and  half  the  number  of  people  who  left  the  &th  inter¬ 
section  of  the  nth  row  (or  Hn, c/2).  Thus, 

Hn+1k  =  ~n*-d-±  -Js  for  1  <  k  <  n  .  (1.3) 

Finally,  the  number  of  people  who  enter  the  (n  +  l)th  intersection 
of  the  ( n  +  l)th  row  is  equal  to  half  the  number  of  people  who  left  the 
nth  intersection  of  the  nth  row : 

Hn+\+ 1  =  ■  (1.4) 

The  relations  (1 . 1)— (1 .4)  allow  us  to  establish  the  fact  that  the  problem 
has  a  solution.  Actually,  from  equations  (1.2)— (1 .4)  it  follows  that  if  for 
any  fixed  n  all  numbers  of  the  nth  row  ( Hn0 ,  Hn x, . . . ,  Hnn)  exist  and  are 
divisible  by  2a,  then  all  numbers  of  the  (n  +  l)th  row  (//n  +  10,  i/n  +  l1; 
. . .,  Hn  +  1n+1)  exist  and  are  divisible  by  a.  For  if  we  suppose  that 
Hn0,Hn1,...,H\  exist  and  are  all  divisible  by  2a,  then  there  are 
integers  (whole  numbers)  Mn0,  Mnu  . . .,  M  nn  satisfying  the  relations 

Hn  o  =  2aMn0 
H\  =  2  aM\ 

Hnn  =  2aMnn . 

Thus,  we  have  (using  (1.2)— (1.4)): 

Hn+\  =  ^Y  =  aMn  0; 

„n  +  1  Hnk.1  +  H\  2  flA/Vr  +  2aMnk 

k - 2  2 

=  a(Mnk-1  +  M nk)  for  1  <  k  <  n ; 

Hn+\+1  =  Zf  =  aMnn. 

This  establishes  the  claim  that  the  numbers  Hn+10,  Hn  +  11 . Hn+1n+1 

exist  and  are  all  divisible  by  a. 
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Therefore,  since  all  numbers  of  the  zeroth  row  (there  is  only  one, 
H°0)  exist  and  are  divisible  by  21000  (by  1.1),  we  have  verified  that  all 
numbers  of  the  first  row, 


H° x,  H\  , 

exist  and  are  divisible  by  2889;  all  numbers  of  the  second  row. 


H\,  H\,  H\ , 

exist  and  are  divisible  by  2888;  and  so  on,  until  all  numbers  of  the  999th 
row, 


1/999  77999  77999 

n  Oj  ”  19  •  •  •>  n  999  J 

exist  and  are  divisible  by  2;  and  all  numbers  of  the  1000th  row, 

77IOOO  7/1000  7/1000 

11  Oj  -*-*  !»•••»■«  1000  j 


exist  (and  are  divisible  by  1). 

The  relations  (1.2)— (1.4)  not  only  show  that  the  problem  has  a  solu¬ 
tion,  but  also  provide  a  method  for  calculating  the  line  of  numbers 

fJn  +  l  zi  n  +  1  I/n  +  l 

n  0)  n  It  ■  ■  •>  n  n+1 

from  the  line 

Repeatedly  applying  these  relations,  beginning  with  the  zeroth  line  (by 
using  (1.1)),  we  theoretically  can  calculate  the  numbers  H\  for  all 
501,501  intersections  in  all  the  rows  through  the  1000th  and,  in  par¬ 
ticular,  for  all  intersections  of  the  1000th  row,  thus  solving  the  problem. 
The  direct  calculations  for  the  first  rows  are: 


7/0  91000  7/0  91000 

fliQ  —  0  =  _ _ —  2999  •  H11  =  0  =  - _ =  2999  * 


7/1  9999  7/1  1  7/1  9999  1  9999 

_  _ _ 0  _  ^ _  _  2998  .  H2i  =  12  °  ^  11  1  =  ^  Z  _  2^99  • 

7/1  9999  7/2  9998 

=  11  1  =  _ _  —  2998  .  H30  =  11  0  =  - _  =  2997  * 

7/2  1  7/2  9998  !  9999 

H\  =  1  =  - - tr=—  =  3-2887  ;  and  so  on  . 


2 


What  It  Means 
to  Solve 
a  Problem 


Thus,  the  problem  of  chapter  1  is  solved  . . . 

“How  is  it  solved?”  wonders  an  unconvinced  reader  (the  convinced 
reader  knows  in  advance  what  the  author  is  going  to  say,  and  nothing 
makes  him  wonder).  “  I  don’t  see  that  we  have  solved  it.” 

Author:  Well,  of  course  we  have  solved  it.  You  know  that  to  solve 
a  problem  means  to  find  its  solution.  And  we  have  just  found  the 
solution. 

Reader  ( indignantly ):  Is  this  really  a  solution? 

Author  ( pretending  that  he  doesn't  understand  what  the  trouble  is): 
Well,  is  it  really  incorrect? 

Reader:  No,  “it”  is  correct,  but  “it”  is  not  a  solution. 

Author:  But  then  what  is  a  solution? 

Reader:  A  line  of  numbers  showing  how  many  people  arrive  at  each 
intersection  of  the  thousandth  row. 

Author:  But  there  would  be  1001  numbers  in  this  line.  Is  it  possible 
that  the  organizers  of  the  Eighth  Olympiad  wanted  the  participants  to 
write  1001  numbers? 

Reader  becomes  thoughtful. 

Author:  I  have  a  proposal.  So  as  not  to  complicate  the  situation 
with  long  sequences,  let  us  select  one  intersection,  and  concern  ourselves 
with  the  number  of  people  who  arrive  there.  All  right? 

Reader  agrees. 

Author:  Now  what  will  we  consider  to  be  a  solution  to  the  following 
problem:  How  many  people  arrive  at  the  third  intersection  of  the 
fourth  row? 

Reader:  Huh?  A  number. 

Author:  Written  how? 

Reader  {amazed):  Well,  in  the  decimal  system. 

Author:  But  isn’t  an  answer  like  “i/43”  a  solution? 
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Reader:  Of  course  not.  Some  solution! 

Author:  By  continuing  the  series  of  calculations  which  we  began 
at  the  end  of  the  preceding  section,  it  is  easy  to  verify  that  2998  people 
visit  the  third  intersection  of  the  fourth  row.  Will  the  answer  “2998” 
be  a  solution  to  the  problem  ? 

Reader  (still  not  seeing  the  trap):  Yes,  of  course. 

Author:  But  you  know  that  the  expression  “2988”  is  not  an  expres¬ 
sion  in  the  decimal  system.  This  expression  consists  of  two  decimal 
numbers,  “2”  and  “998,”  whose  relative  position  shows  what  operation 
must  be  performed  upon  them  in  order  to  obtain  the  desired  number. 

Reader:  But  the  expression  “2998”  can  easily  be  converted  into 
decimal  form. 

Author:  Not  so  easily;  just  try  to  raise  2  to  the  998th  power.  But 
that  isn’t  even  the  trouble;  the  trouble  is  that  just  now  you  contradicted 
your  previous  statement.  Earlier,  you  agreed  to  consider  only  a  number 
written  in  the  decimal  system  as  a  solution.  From  the  point  of  view  of 
this  definition,  the  expression  “2998”  is  still  not  a  solution  (it  is  a  so- 
called  half-finished  product  from  which  the  solution  may  be  derived). 
Of  course,  such  a  point  of  view  is  acceptable  only  if  it  is  held  con¬ 
sistently.  But  another  point  of  view  is  possible,  according  to  which  2998 
is  a  solution.  Such  a  point  of  view  will  probably  clarify  the  matter  for 
you.  You  know  that  often  the  simplest  answers  to  mathematical 
problems  come  not  directly  in  the  form  of  a  number  written  down  in  the 
decimal  system,  but  in  related  “indirect”  form.  With  this  in  mind,  what 
should  we  settle  for  as  a  “solution,”  in  our  example,  to  the  problem  of 
how  many  people  visit  the  third  intersection  of  the  fourth  row? 

Reader:  In  our  example,  we  must  accept  as  a  solution  any  expression 
for  which  there  is  a  method  which  lets  us  get  a  numerical  answer 
(written  in  the  decimal  system)  from  that  expression.  That  is,  2998  will 
be  a  solution.  Although  the  method  of  getting  a  decimal  answer  (997 
consecutive  multiplications)  is  long,  it  is  feasible  in  principle. 

Author:  But  then  why  isn’t  7/43  a  solution?  Here,  too,  there  is  a 
method  of  getting  a  decimal  answer.  It  is  given  by  the  relations  (1.1)— 
(1.4). 

Reader  is  perplexed. 

Author  (satisfied  that  he  has  succeeded  in  leading  the  reader  into  a 
blind  alley — the  inexperienced  reader,  that  is:  the  experienced  reader  will 
himself  lead  the  author  into  a  blind  alley):  The  point  is  that  there  are  at 
least  three  interpretations  of  what  we  mean  by  a  solution  to  the  problem 
of  the  number  of  people  who  visit  a  given  intersection. 

First  interpretation:  By  a  solution,  we  mean  a  number  written  in  the 
decimal  system. 
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Second  interpretation:  By  a  solution,  we  mean  some  expression  which 
designates  a  number,  and  for  which  a  method  is  known  that  allows  us 
to  get  the  designated  decimal  number  (a  so-called  first-interpretation 
solution)  from  the  expression. 

Third  interpretation:  By  a  solution,  we  mean  some  expression  which 
designates  a  number  (written  in  the  decimal  system)  and  which  is  made 
up  of  numbers  and  some  operations  considered  “standard”  (for 
instance,  the  usual  arithmetic  operations).1  We  require  that  each 
standard  operation  be  accompanied  by  a  method  of  getting  the  decimal 
result  from  the  decimal  numbers  to  which  it  is  applied  (as  is  the  case 
with  the  arithmetic  operations).  Then  for  each  expression  which  is 
allowed,  there  will  exist  a  method  allowing  us  to  get  the  decimal 
number  designated  from  the  decimal  numbers  which  are  part  of  the 
expression,  so  that  a  solution  under  the  third  interpretation  will  auto¬ 
matically  be  a  solution  under  the  second. 

Under  the  first  interpretation,  neither  Hi3  nor  2988  is  a  solution  to 
the  problem  of  finding  the  number  of  people  arriving  at  the  third  inter¬ 
section  of  the  fourth  row.  To  get  a  solution,  we  must  find  a  decimal 
expression  for  2998;  however,  this  expression  would  consist  of  more 
than  300  digits,  and  could  be  calculated  in  a  reasonable  amount  of  time 
only  by  a  computer. 

Under  the  second  interpretation,  both  H*3  and  2998  are  solutions. 

In  the  case  of  the  third  interpretation,  everything  depends  upon  the 
choice  of  the  initial  standard  operations:  If  exponentiation  is  included 
among  them,  then  2988  will  be  a  solution;  if  it  is  not  included,  then  2988 
will  not  be  a  solution.  If  the  standard  operations  include  the  operation 
H  which  calculates  the  number  Hnk  from  the  numbers  n  and  k  (note 
that  the  relations  (1 . 1)— (1 .4)  give  a  method  of  performing  such  a 
calculation,  so  that  the  requirement  we  imposed  on  standard  operations 
is  satisfied),  then  Hi3  will  be  a  solution  to  the  problem;  in  the  opposite 
case,  it  will  not. 

The  question  of  whether  we  may  choose  the  standard  operations 
arbitrarily  naturally  arises.  Speaking  formally,  we  may  certainly  do  so. 
In  practice,  of  course,  we  should  choose  as  standard  operations  (through 
which  we  are  required  to  express  the  solution  of  any  problem)  such 
operations  as  are  encountered  in  the  solutions  of  many  problems,  or 
at  least  in  the  solutions  of  important  ones.  Such  operations  include 
the  four  arithmetic  operations  and  several  other  operations,  such  as 

1.  The  set  of  standard  operations  must  be  indicated  beforehand.  It  is  important 
to  emphasize  that  the  third  interpretation  depends  on  the  choice  of  this  set.  Thus, 
the  expression  2998  will  be  a  solution  under  the  third  interpretation  precisely  if  the 
operation  of  exponentiation  is  included  as  one  of  the  standard  operations. 
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exponentiation  and  the  operation  of  taking  factorials  (see  below,  chapter 
6).  If  the  operation  H  were  needed  for  important  problems  or  if  our 
own  problem  about  intersections  were  very  important,  then  perhaps  the 
operation  H  would  deserve  to  be  ranked  with  the  standard  operations. 
However,  the  operation  H  was  undeserving  before  we  introduced  our 
problem,  and  is  scarcely  worthy  now.  In  section  4  we  will  examine  an 
operation  similar  to  H  which,  as  we  shall  see,  deserves  to  be  included 
as  a  standard  operation. 

But  now  we  must  return  to  our  original  problem  of  the  intersections 
of  the  thousandth  row.  Its  solution  may  be  sought  in  three  forms, 
corresponding  to  the  three  interpretations  of  a  “solution”  described 
above. 

1.  In  the  form  of  a  sequence  of  1001  numbers,  written  in  the  decimal 
system.  We  shall  not  seek  such  a  solution  (since  we  found  it  too  difficult 
even  to  find  such  a  solution  for  one  intersection  of  the  fourth  row). 

2.  In  the  form  of  an  expression  which  in  principle  allows  us  to  calcu¬ 
late  the  number  (that  is,  to  find  a  decimal  representation  of  the  number) 
of  people  who  arrive  at  each  intersection  of  the  thousandth  row.  We 
have  already  found  such  a  solution:  H1000k,  for  which  the  process  of 
calculation  is  given  by  the  relations  (1.1)— (1.4). 

3.  In  the  form  of  an  expression  which  not  only  allows  us  to  calculate 
H1000k  for  any  k  from  0  to  1000,  but  which  is  formed  by  means  of 
certain  standard  operations.  It  is  in  this  form  that  we  shall  seek  a 
solution.  In  the  following  exposition  it  will  become  clear  precisely  which 
operations  will  be  considered  standard  for  our  purposes. 
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Pascal’s 

Triangle 


Let  us  consider  a  sequence  of  numbers  d0,  du  . . dn,  for  some 
n  =  0,1,2,...  (for  n  =  0  this  sequence  “degenerates”  to  the  sequence 
consisting  of  the  single  number  d0 ).  From  it,  let  us  generate  a  new 
sequence  of  numbers  s0,  su  . . .,  sn+1,  by  the  following  rule: 

s0  =  d0  ,  (3.1) 

sk  =  4-i  +  4  (1  <  k  <  n) ,  (3.2) 

■Sfi+1  =  4  •  (3.3) 

We  say  that  this  new  sequence  is  derived  from  the  original  one  by 
Pascal's  relations.  For  example,  from  the  sequence  2, 0,  —  2,  we  may 
produce  (using  Pascal’s  relations)  the  new  sequence  2,  2,  —  2,  —  2,  and 
from  this  one  in  turn  the  sequence  2,  4,  0,  -4,-2. 

The  French  mathematician  and  philosopher  Blaise  Pascal  (1623-62) 
investigated  the  properties  of  a  triangular  table  of  numbers,  each  row 
of  which  is  derived  from  the  preceding  by  the  relations  (3. 1)— (3.3).  This 
table,  which  we  shall  examine  further,  is  now  known  as  “Pascal’s 
triangle.” 

Remark  1.  If  the  sequence  jS  is  derived  from  the  sequence  a  by 
Pascal’s  relations,  then  the  sum  of  the  elements  of  sequence  f3  is  equal  to 
twice  the  sum  of  the  elements  of  sequence  a.  For  using  the  relations 
(3.1)— (3.3), 


+  Jx  +  H - b  +  J„+i 

—  4  +  (4  +  4)  +  (4  +  4)  h - p  (4-i  +  4)  +  4 

=  (4  +  4)  +  (4  +  4)  3 - P  (4  +  4) 

=  2(4  +  4  h - p  4)  • 
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(3.4) 
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Remark  2.  We  say  that  a  sequence  of  numbers  d0, . . . ,  dn  is  symmetric 
if  for  every  whole  number  k  from  0  to  n, 

dk  =  dn_k.  (3.5) 

For  example,  the  four-element  sequence  1,  0,  0,  1  is  symmetric. 

A  sequence  s0, . . . ,  sn+l  which  is  derived  from  a  symmetric  sequence 
d0, . . .,  dn  by  Pascal’s  relations  is  itself  symmetric.  To  establish  this,  we 
must  verify  the  relations 


sk  ~  s(n+l)-k  (3-6) 

for  k  =  0,  1,  2, . . .,  n  +  1.  But  for  k  —  0  and  k  =  n  +  1,  the  equation 
(3.6)  follows  from  the  relations  (3.1)  and  (3.3)  and  the  condition 
do  =  dn  (which  we  get  from  (3.5)  for  k  =  0).  For  1  <  k  <  n,  we  have: 

sk  =  *4-1  +  *4  =  *4-(ft-l)  +  *4-fc  =  *4n+  l)-k  +  *4n+l)-fc]-l 
=  *A(n+l)-k]-l  +  *An+l)-fc  =  ^(n+l)-fc-  (3-7) 

In  the  case  of  our  example,  application  of  Pascal’s  relations  to  the 
sequence  1,  0,  0,  1  yields  the  five-element  sequence  1,  1,  0,  1,  1  which 
is  itself  symmetric. 

Let  us  now  look  at  the  sequence  consisting  of  the  single  number  1 . 
We  shall  call  this  sequence  Pascal's  zeroth  sequence.  From  it,  we  may 
use  Pascal’s  relations  to  generate  a  new  sequence,  which  we  shall  call 
Pascal's  first  sequence.  Applying  Pascal’s  relations,  we  may  then 
generate  Pascal's  second  sequence  from  Pascal’s  first  sequence,  and  so  on. 
Since  in  each  transition  to  a  new  sequence,  the  number  of  sequence 
elements  is  increased  by  one,  there  will  be  n  +  1  numbers  in  Pascal’s  nth 
sequence.  Without  carrying  out  any  calculations,  we  observe  that 
remarks  1  and  2  allow  us  to  conclude: 

1.  The  sum  of  the  numbers  in  Pascal’s  nth  sequence  is  2n  (since  in 
proceeding  from  one  sequence  to  the  next,  the  sum  of  the  numbers  is 
doubled,  and  the  sum  of  the  numbers  of  the  zeroth  sequence  is  2°  =  1). 

2.  All  of  Pascal’s  sequences  are  symmetric  (since  the  property  of 
symmetry  is  preserved  in  passing  from  one  sequence  to  the  next,  and  the 
zeroth  sequence  is  symmetric). 

Let  us  write  down  Pascal’s  sequences,  one  under  another,  so  that  each 
number  of  each  sequence  is  found  between  and  below  those  numbers  of 
the  preceding  row  from  which  it  is  calculated.  We  obtain  an  infinite 
table,  called  Pascal's  triangle,  which  fills  the  interior  of  an  angle;  any 
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Fig.  3.1 


segment  of  it,  composed  of  the  zeroth  through  nth  rows,  forms  a 
triangle.  A  segment  of  Pascal’s  triangle,  consisting  of  the  first  fifteen 
rows  (the  zeroth  through  the  fourteenth)  is  presented  in  figure  3.1. 

Pascal’s  triangle  is  symmetric  about  the  bisector  of  the  angle  whose 
interior  it  fills,  a  consequence  of  the  fact  that  each  of  its  rows  is  sym¬ 
metric.  The  numbers  in  it  also  satisfy  a  number  of  interesting  properties. 
For  example,  the  sum  of  the  squares  of  the  elements  of  any  row  is  equal 
to  some  element  of  the  vertical  column  along  the  bisector  of  the  angle. 
For  any  prime  number  p,  all  elements  of  the  pth  row,  except  the  first 
and  last,  are  divisible  by  p.1  (A  prime  number  is  a  positive  whole  number 
whose  only  positive  whole  number  divisors  are  itself  and  1.) 

It  is  clear  that  a  method  of  constructing  Pascal’s  triangle  may  be 
given  without  relying  on  the  notions  of  “Pascal’s  relations”  or  “Pascal’s 
sequences”:  Pascal’s  triangle  is  simply  an  infinite  numerical  table  in 
“  triangular  form  ”  in  which  each  entry  along  the  sides  is  1,  and  in  which 
each  of  the  other  entries  is  the  sum  of  the  two  entries  above  it  (to  the 
right  and  to  the  left).  The  triangle  first  appeared  in  Pascal’s  paper 
“Treatise  on  the  Arithmetic  Triangle,”  published  posthumously  in  1665. 
In  that  work  the  table  reproduced  in  figure  3.2  was  published,  in  which 
each  entry  is  the  sum  of  the  preceding  entry  in  the  same  horizontal  row 
and  the  preceding  entry  in  the  same  vertical  column.2 


1.  More  of  these  properties  are  described  on  pp.  36-40  and  50-53  of  the  book 
Problems  in  the  Theory  of  Numbers  by  E.  B.  Dynkin  and  V.  A.  Uspenskii  (Boston: 
D.  C.  Heath  and  Company,  1963). 

2.  See  B.  Pascal,  Oeuvres  completes,  vol.  3  (Paris:  Hachette  et  Cie,  1908),  p.  244. 


12 


Pascal’s  Triangle 


1 

— 

1 

1 

1 

1 

1 

1 

1 

1  ’ 

1 

1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

3 

6 

10 

15 

21 

28 

36 

1 

4 

10 

20 

35 

56 

84 

1 

5 

15 

35 

70 

126 

1 

6 

21 

56 

126 

l 

7 

28 

84 

1 

8 

36 

1 

9 

1 

Fig.  3.2 


Thus,  what  we  call  “Pascal’s  triangle”  differs  from  the  “triangle” 
examined  by  Pascal  himself  by  a  rotation  through  45°. 

Pascal  investigated  in  detail  the  properties  and  applications  of  his 
“triangle”;  several  such  applications  will  be  examined  in  the  following 
section.  For  the  present,  we  shall  examine  three  of  the  “triangle’s” 
properties  which  were  first  noted  by  Pascal  himself.  For  this  purpose 
(and  only  at  this  point  in  our  exposition)  we  shall  consider  that  arrange¬ 
ment  of  the  triangle  in  the  plane  which  Pascal  employed,  and  we  shall 
speak  of  “  rows  ”  and  “  columns.” 

Property  1.  Each  number  A  in  the  table  is  the  sum  of  the  numbers  in 
the  preceding  row,  from  the  leftmost  to  the  number  which  stands 
directly  above  A  (see  fig.  3.3,  in  which  the  squares  containing  the 
summands  which  give  the  sum  A  are  shaded). 

Property  2.  Each  number  A  in  the  table  is  the  sum  of  the  numbers  in 
the  preceding  column,  from  the  topmost  to  the  number  standing 
directly  to  the  left  of  A  (fig.  3.4). 


Fig.  3.3 


Fig.  3.4 


Fig.  3.5 
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Property  3.  For  each  number  A  in  the  table,  —  1  is  equal  to  the 
sum  of  all  the  numbers  contained  in  the  rectangle  bounded  by  that 
column  and  that  row  whose  intersection  is  the  entry  A  (this  row  and 
column  is  not  included  in  the  rectangle;  see  fig.  3.5). 

The  proof  of  property  1  is  by  mathematical  induction,  a  convenient 
method  of  proof  for  assertions  about  the  nonnegative  integers  (whole 
numbers).  The  proof  of  such  an  assertion  for  all  nonnegative  integers  m 
involves  two  steps:  (1)  establishing  the  assertion  for  m  —  0;  (2)  a  proof 
that  the  validity  of  the  assertion  for  m  =  k  implies  its  validity  for 
m  =  k  +  1. 

Once  these  two  requirements  are  satisfied,  the  desired  assertion  is 
proved  for  all  nonnegative  integers  m,  for  since  the  truth  of  the  assertion 
for  m  =  0  is  established  in  the  first  step,  the  second  step  allows  us  to 
conclude  that  the  assertion  is  true  for  m  —  1 .  Applying  the  second  step 
again,  we  may  conclude  that  the  assertion  is  true  for  m  —  2,  and  so  on. 

Property  1  may  be  proved  by  mathematical  induction  as  follows: 
Number  the  rows  and  columns  (starting  from  the  top  and  left)  of 
the  triangle  pictured,  starting  with  zero.  Let  Anm  denote  the  entry  in  the 
nth  row  and  mth  column.  The  assertion  is  that  A"m  is  the  sum  of  the 
first  m  +  1  entries  of  the  (n  —  l)th  row,  or 

Anm  =  An-\  +  A'-\  +•••+  A*-'m.  (3.8) 

Step  1.  If  m  —  0,  equation  (3.8)  becomes 

An  _  An- 1 
Si.  q  —  Si.  o  • 

Since  An0  =  An~10  =  1,  the  assertion  holds  for  m  —  0. 

Step  2.  Assuming  (3.8)  for  m  =  k,  and  using  the  fact  that  each 
“interior”  entry  is  the  sum  of  the  entry  immediately  preceding  it  in  its 
column  and  the  entry  immediately  preceding  it  in  its  row,  we  obtain 

An  —An  1  An- 1 
A  k+ 1  —  A  k  +  A  fc+1 

=  A'~\  +  A'~\  +  •  •  •  +  A'-\  +  A»-\+1  (3.9) 

by  using  our  assumption  (“inductive  hypothesis”)  on  k,  that 

A\  =  An~\  +  An~\  +  ■■■+  An~\. 

Since  (3.9)  is  a  restatement  of  (3.8)  for  m  =  k  +  1,  we  have  shown  that 
the  truth  of  our  assertion  (property  1)  for  m  =  k  implies  its  truth  for 
m  =  k  +  1 .  This  completes  the  second  step  of  the  proof. 
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Property  2  may  be  proved  by  performing  an  induction  on  n  rather 
than  m;  property  3  follows  either  from  property  1  by  induction  on  n  or 
from  property  2  by  induction  on  m. 

More  than  a  century  before  Pascal’s  treatise,  however,  an  interesting 
table — not  in  “triangular”  but  in  “rectangular”  form — was  published 
in  the  General  Treatise  on  Number  and  Measure  (published  in  1556-60), 
which  also  appeared  after  the  death  of  its  author,  the  distinguished 
Italian  mathematician  Niccolo  Tartaglia  (1506-59).  Tartaglia’s  table 
had  the  form  shown  in  figure  3.6.3 
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Fig.  3.6 


Here  each  entry  in  the  zeroth  row  is  1 ;  in  each  of  the  remaining  rows 
the  leftmost  (zeroth)  entry  is  1 ,  and  each  succeeding  entry  is  formed  as 
the  sum  of  the  two  entries  directly  before  it  and  above  it.  The  table 
which  Tartaglia  introduced  is  called  (naturally)  “Tartaglia’s  rectangle.” 

The  elements  of  each  of  Pascal’s  sequences  are  usually  numbered  from 
left  to  right,  beginning  with  the  zeroth.  Thus,  the  second  place  in  the 
fifth  row  is  occupied  by  the  number  10.  The  number  occupying  the  Ath 
place  in  the  «th  row  will  be  denoted  by  Tnk  so  that,  for  example, 
T° o  =  1,  T5 2  =  10,  I"14 4  =  1001.  The  expression  Tnk  will  obviously  be 
defined  for  any  n  >  0  and  A  =  0,  1, . . . ,  n. 

Let  us  examine  the  infinite  sequence  formed  by  the  numbers  Tnk  for 
any  fixed  k  and  variable  n,  that  is,  the  sequence 

T\,  Tk+1k,  Tk+\,...,T\ .  (3.10) 

This  sequence  begins  with  Tkk  since  the  Ath  row  is  the  first  row  which 
has  a  Ath  entry.  Its  elements  are  the  numbers  in  Pascal’s  triangle  occur¬ 
ring  in  the  “Ath  line  from  the  left,  parallel  to  the  left  side,”  and  also, 
because  of  the  triangle’s  symmetry,  the  numbers  occurring  in  the  “Ath 

3.  See  G.  G.  Tseiten,  lstoriya  matematiki  v  XVI  i  XVII  vekakh  [The  history  of 
mathematics  in  the  sixteenth  and  seventeenth  centuries]  (Moscow- Leningrad: 
GONTI,  1938),  p.  116. 
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line  from  the  right,  parallel  to  the  right  side.”  In  Tartaglia’s  rectangle, 
these  numbers  fill  the  Arth  column  and  the  &th  row. 

For  k  =  0  we  get  the  sequence 

1,1,1,  1,1,1,... 

(the  zeroth  column  or  the  zeroth  row  in  Tartaglia’s  rectangle). 

For  k  —  1  we  get  the  sequence  of  natural  numbers 

1,2,  3,  4,5,6,... 

(the  first  row  or  first  column  of  Tartaglia’s  rectangle). 

For  k  =  2  we  get  the  sequence 

1.3,  6,  10,  15,21,... 

(the  second  row  or  second  column  of  Tartaglia’s  rectangle).  The  elements 
of  this  sequence  are  called  triangular  numbers ;  the  mth  triangular  num¬ 
ber  is  Tm  +  12,  so  that  1  is  the  first  triangular  number,  3  is  the  second 
triangular  number,  and  so  on.  This  name  is  a  result  of  the  fact  that  the 
mth  triangular  number  Tm  +  12  is  the  number  of  spheres  (or  other  identi¬ 
cal  objects)  which  can  be  packed  in  the  shape  of  an  equilateral  triangle 
whose  base  is  made  up  of  m  spheres  (see  fig.  3.7).  In  particular,  the  mth 
triangular  number  is  the  number  of  elements  contained  in  the  first  m 
rows  of  Pascal’s  triangle,  from  the  zeroth  to  the  (m  —  l)th. 

Letting  k  =  3,  we  get  the  sequence 

1.4,  10,20,  35,  56,... 

(the  third  row  or  third  column  of  Tartaglia’s  rectangle).  The  numbers  of 
this  sequence  are  called  pyramidal  numbers,  or  more  precisely,  tetra¬ 
hedral  numbers;  1  is  the  first  tetrahedral  number,  4  the  second,  10  the 


(i) 

o 


(3) 

o 

o  o 


(6) 

o 

o  o 
o  o  o 


Fig.  3.7 
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third,  and  so  on,  so  that  the  mth  tetrahedral  number  is  Tm+23.  The  mth 
tetrahedral  number  Tm+23  is  the  number  of  spheres  which  can  be  packed 
in  the  shape  of  a  tetrahedron  (triangular  pyramid)  with  an  equilateral 
triangular  base  of  side  m  (see  fig.  3.8).4 


(i) 

o 


(4)  (10) 


4.  Triangular  and  pyramidal  numbers  (which  are  special  cases  of  the  so-called 
figure  numbers)  were  of  interest  to  the  ancient  Greeks,  who  attributed  mystical 
properties  to  them.  Of  the  writings  preserved  today  in  which  these  numbers  are 
examined,  the  earliest  is  probably  Introduction  to  Arithmetic,  by  the  ancient  Greek 
mathematician  Nicomachus  of  Gerasa,  who  lived  around  a.d.  100.  See  D.  J. 
Struik,  A  Concise  History  of  Mathematics  (New  York:  Dover  Publications,  1967), 
p.  72;  B.  L.  van  der  Waerden,  Science  Awakening  (New  York:  Oxford  University 
Press,  1961,  pp.  98-100).  According  to  indirect  information,  however,  polygonal 
numbers  were  studied  considerably  earlier,  in  the  2d  century  b.c.,  and  even  earlier, 
in  the  5th  century  b.c.  by  the  famous  mathematician  Pythagoras  and  his  students, 
the  Pythagoreans  (see  pp.  46-47  of  the  above  book  by  D.  J.  Struik). 
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By  virtue  of  their  definition,  the  numbers  T\  are  subject  to  the 
following  relations: 

T°  o=l,  (4.1) 

Tn+10  =  Tn+1n+1  =  1  for  n  =  0,  1,  2, ... ,  (4.2) 

Tn+\  =  T\-r  +  T\  for  n  =  0,  1,  2, . . .;  A:  =  1,  2, . . .,  n  .  (4.3) 

The  numbers  T\  are  completely  determined  by  these  relations;  using 
the  equations  (4.  l)-(4.3),  we  may  construct  as  many  rows  of  Pascal’s 
triangle  as  we  wish. 

The  definition  of  Tnk  may  be  extended  in  a  natural  way  so  that  it 
makes  sense  for  any  nonnegative  integer  n  and  any  integer  k.  To  do  this, 
we  set  Tnk  —  0  for  n  >  0  and  for  k  such  that  0  >  k  or  k  >  n.  Thus, 
Tnk  =  0  for  all  pairs  (n,  k )  for  which  n  >  0,  k  <  0,  and  for  all  pairs 

( n ,  k)  for  which  n  >  0,  k  >  n.  The  relation  Tn+1k  =  7’nfc_1  +  Tnk  will 

then  be  satisfied  for  all  k  (and  not  only  for  k  from  1  to  n,  as  in  [4.3]),  and 
the  numbers  Tnk  will  be  completely  determined  by  the  following 
equations: 


T°  o  =  l,  (4.4) 

T\  =  0  for  k  *  0  ,  (4.5) 

Tn+1k  =  TVi  +  T\  for  all  n  >  0  and  all  k  .  (4.6) 

These  relations  permit  us  to  give  a  graphic  representation  of  the 
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generation  of  Pascal’s  triangle.  Let  us  consider  an  infinite  table  of 
zeroes,  arranged  in  staggered  rows,  as  shown  below: 

....  0  0  0  0  .... 

....  00000  .... 

....  0000  .... 

....  00000  .... 


It  is  clear  that  such  a  table  satisfies  Pascal’s  relations,  which  require  each 
number  to  be  the  sum  of  the  two  nearest  numbers  in  the  preceding  row.1 
We  now  imagine  that  one  of  the  zeroes  in  the  first  row  of  this  table  is 
replaced  by  a  one.  If  Pascal’s  relations  are  to  be  preserved,  then  the 
“perturbation”  will  “enlarge  to  an  angle”— just  like  a  wave  in  a  brook 
when  disturbed  by  a  stick — in  the  form  of  Pascal’s  triangle: 

....  00  1  00  .... 

....001  100.... 

....01210.... 

....  0  1  3  3  1  0  .... 


Given  arbitrary  n  and  k  (n  =  0,  1,  2, . . . ;  k  =  0,  1, . . .,  ri),  it  would 
be  possible  to  find  Tnk,  if  we  had  sufficient  time  and  patience,  by 
writing  out  Pascal’s  triangle  and  continuing  until  we  arrived  at  the  fcth 
number  of  the  nth  row.  Or  we  could  take  advantage  of  the  relations 
(4. 1)— (4.3)  which  permit  us  to  determine  Tnk  by  performing  a  finite 
number  of  additions. 

We  leave  it  as  a  problem  for  the  reader  to  determine  the  minimum 
number  of  additions  which  must  be  carried  out  to  calculate  Tnk,  using 
the  relations  (4.1)— (4.3),  for  given  n  and  k.  (Hint:  Try  to  take  advantage 
of  the  symmetry  of  Pascal’s  triangle.) 

1 .  We  will  consider  an  infinite  row 


.  .  S-3,  S-2,  S-i,  So,  Si,  Sz,  . . . 

to  be  derived  from  the  infinite  row 

•  ■  ■ ,  d-z,  d-x,  d0,  dx,  dz, . .  . 

by  Pascal’s  law,  when  sk  =  «4_i  +  dk  for  each  k.  The  definition  of  Pascal’s  law 
for  finite  rows  follows  from  this  if  each  finite  row 


Xo,  Xx,...,Xn 


is  identified  with  the  infinite  row 


. ...  0,  0,  0,  x0,  Xx, . . . ,  xn,  0,  0,  0 . 
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Let  us  agree  to  call  the  operation  of  calculating  Tnk  from  the  numbers 
k  and  n,  Pascal's  operation.  Pascal’s  operation  is  then  defined  for  any 
n  and  k  for  which  n  >  0,  0  <  k  <  n.2 3 

But  if  we  redefine  Tnk  according  to  relations  (4.4)-(4.6),  then  Pascal’s 
operation  will  be  defined  for  any  nonnegative  integer  n  and  any 
integer  k. 

With  the  help  of  Pascal’s  operation,  it  is  easy  to  write  down  the 
numbers  Hnk,  which  serve  as  a  solution  to  the  Olympiad  problem  of 
section  1.  To  find  these  numbers,  we  first  define  (for  m  =  0,  1, . . .,  1000; 
q  =  0, 


7m  _  _ 1 _  um  (A  7V5 

^  «  21000~m  9 

so  that 

H\  =  21000~mZmq .  (4.8) 

Then,  from  the  relations  (4.7)  and  (1.1),  we  get 

Z°o  =  2lLo-21000  =  1-  (4-9) 

In  the  relations  (1.2),  (1.4),  and  (1.3)  we  may  then  replace  the  number 
Hmg  by  its  expression  in  terms  of  Zm,  given  in  (4.8).  We  get,  from  (1.2), 

7IOOO-7177; 

21000~<rt  +  l)^n  +  lQ  _  ± _  0  > 

from  which  we  obtain 


Zn  +  10  =  Zn o  .  (4.10) 

In  exactly  the  same  way,  from  (1.4)  we  get 

71000-11771 

21000 -(71  +  +1  =  _ _  77  ^ 


2.  Pascal  himself  (proceeding  from  the  rectangular  arrangement  of  the  table, 
which  he  proposed — see  fig.  3.2,  p.  12,  above)  examined  a  different  operation  in 
his  treatise,  that  of  finding  the  number  standing  at  the  intersection  of  the  xth 
column  and  the  yth  row  (with  the  rows  and  columns  numbered  beginning  with  the 
first,  so  that  this  operation  is  defined  for  x  >  1,  y  >  1)  from  the  numbers  x  and  y. 
If  this  number  is  denoted  by  P(x,  y),  then,  as  one  may  easily  verify,  P(x,  y)  = 

+  from  which  T\  =  P(k  +  1,  n  —  k  +  1). 

3.  Since  2  to  the  zeroth  power  is  considered  to  be  equal  to  1,  for  m  =  1000, 
equation  (4.7)  takes  the  form  Z1000,  =  H1000,. 
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from  which 


z»+ 


1 

n+l 


=  zn 

—  **  n  • 


(4.11) 


Finally,  from  (1.3)  we  get 


21000-(n  +  l)Z"  +  lfc  — 


2iooo -n ■Znic  j  +  21000~n-Znk 


1  <  k  <  n. 


from  which 


Zn+\  =  ZBfc_x  +  Z\ ,  1  <  k  <  n.  (4.12) 

The  equations  (4.10)-(4.12)  show  that  each  sequence 

...  _  /7H+1  7n  +  l  \ 

“n+l  ~  0>  •  •  •>  *-•  n+l/  > 

where  n  =  0,  1, . . .,  999  is  obtained  from  the  preceding  sequence 
wn  =  <Zn0,...,ZBn> 

according  to  Pascal’s  relations.  Since,  as  is  clear  from  equation  (4.9), 
the  initial  sequence 


o>0  —  CZ°o> 


is  Pascal’s  zeroth  sequence,  then  the  sequence  following  it,  <olt  is  Pascal’s 
first  sequence;  the  sequence  cu2  is  Pascal’s  second  sequence;  and  so  on.  For 
each  m  from  0  to  1000, 4  the  sequence  com  is  Pascal’s  /nth  sequence,  and 

Zm,  =  T\  .  (4.13) 

Consequently,  by  (4.8),  for  each  m  =  0,  1, . . .,  1000  and  for  each 
q  =  0, 1,  ...,/n, 


jjm  _  21000 


In  particular,  for  m  =  1000, 


H100\ 


_  ylOOO 


(4.14) 


(4.15) 


4.  For  m  >  1000,  the  sequence  <om  is  not  defined. 
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Thus,  the  number  of  people  who  arrive  at  an  intersection  in  the  thou¬ 
sandth  row  is  simply  an  element  of  Pascal’s  thousandth  sequence!  If 
Pascal’s  operation  is  considered  to  be  standard,  then  equation  (4.15) 
gives  a  solution  to  the  problem  of  section  1  (in  the  third  form  discussed 
at  the  end  of  chapter  2).  In  the  following  two  chapters  we  shall  see  how 
two  important  problems  can  be  solved  with  the  aid  of  Pascal’s  operation. 


5 


Binomial 

Coefficients 


In  this  section  we  shall  find  expressions  for  the  so-called  binomial 
coefficients  by  using  Pascal’s  operation.  In  order  to  define  the  binomial 
coefficients,  we  take  the  binomial  1  +  x  and  raise  it  to  powers  0,  1,2, 
3, . . . ,  arranging  the  terms  of  the  resulting  polynomials  in  order  of 
ascending  powers  of  the  symbol  x.  We  get 

(1  +*)»  =  !,  (5.1) 

(1  +  x)1  =  1  +  x ,  (5.2) 

(1  +  xf  =  (1  +  *)(1  +  *)  =  1  +  2x  +  x2,  (5.3) 

(1  +  xf  =  (1  +  x)2(l  +  jc)  =  1  +  3x  +  3x2  +  *3,  (5.4) 

and  so  on. 

In  general,  for  any  nonnegative  integer  n, 

(1  +  x)n  =  a0  +  axx  +  a2x 2  -1 - b  a„xp ,  (5.5) 

where  a0,  au  . . .,  ap  are  constants.  If  you  wish,  you  can  easily  verify 
that  p  =  n  and  that  a0  =  ap  =  1 ;  however,  we  do  not  need  this  now. 
Somewhat  later  on,  we  will  obtain  this  result  as  a  consequence  of  a 
more  general  formula.  At  this  stage,  it  is  sufficient  for  us  to  know  that 
the  result  of  raising  the  binomial  1  +  x  to  the  power  n  (where  n  is  a 
nonnegative  integer)  may  be  written  as  a  polynomial  with  integral 
coefficients,  arranged  in  order  of  increasing  powers  of  the  letter  x,  as 
exhibited  in  the  relation  (5.5).  This  polynomial  is  called  the  binomial 
expansion  of  (l  +  x)n.  Of  course,  its  coefficients  (and  p  +  1,  the  number 
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of  them)  depend  on  n.  In  order  to  stress  this  dependence,  one  often 
makes  use  of  expressions  for  these  coefficients  in  which  n  appears.  Specifi¬ 
cally,  the  coefficient  of  xk  in  the  binomial  expansion  of  (1  +  x)n  will  be 

designated  by  j .  The  numbers  are  called  binomial  coefficients. 

The  relation  (5.5)  may  now  be  written  as 


(1  +  x)n  = 


(5.6) 


and  from  the  relations  (5. 1)— (5.4),  we  get 


We  see  that  for  the  exponents  n  —  0,  1,  2,  3,  the  rows  of  binomial 
coefficients  coincide  respectively  with  the  0th,  1st,  2d,  and  3d  rows  of 
Pascal’s  triangle.  We  shall  now  show  that  the  analogous  relations  hold 
for  each  n.  To  do  this,  we  shall  look  at  how  the  sequence  of  coefficients 
for  (x  +  l)n+1  is  derived  from  the  sequence  of  coefficients  for  (x  +  l)n, 
taking  advantage  of  the  formula 

(1  +  x)n+1  =  (1  +  x)n(l  +  x).  (5.7) 


Let  us  write  down  the  expansions  for  the  left  and  right  sides  of  this 
formula  in  ascending  powers  of  the  letter  x.  For  the  left  side,  formula 
(5.6)  gives  (by  substituting  n  +  1  for  ri) 


(1  +  x)n+1 


("jvrt'v 
+("*  ‘)**+'"+ 


+  •  •  ■ 


(5.8) 
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for  some  q.  By  virtue  of  the  same  formula  (5.6),  we  have  for  the  right 
side: 


Because  of  (5.7),  the  right  sides  of  (5.8)  and  (5.9)  are  equal.  Therefore, 
q  —  p  +  1 ;  equating  coefficients  for  identical  powers  of  the  letter  x, 
we  get 


PJVft)-  <5..o, 

("  t -  1 W*)  'f0  <‘ </■+>.  (5-11) 


The  relations  (5. 10)— (5.12)  show  that  the  sequence  of  coefficients  of 
the  binomial  expansion  of  (x  +  l)n+1  are  derived  from  the  sequence  of 
coefficients  of  the  binomial  expansion  of  (x  +  l)n  by  Pascal’s  law.  Since 
the  sequence  of  coefficients  of  the  binomial  expansion  of  (x  +  1)° 
coincides  with  Pascal’s  zeroth  sequence,  all  succeeding  sequences  of 
coefficients  also  coincide  with  the  corresponding  rows  of  Pascal’s 

/n\ 


triangle.  Therefore,  the  numbers 
with 


are  defined  only  for  k  =  0, 


=  T\ 


(5.13) 
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Remark.  The  question  “What  coefficients  do  x~3  and  x20  have  in  the 
binomial  expansion  of  (x  +  l)5?”  may  be  answered,  “The  coefficients 


are 


zero.”  Therefore,  the  expression  may  be  defined  in  a  natural 


manner  for  the  cases  k  <  0  and  k  >  n  by  setting 


0  in  these  cases. 


Then  the  equation  (5.13)  will  hold  true  for  all  nonnegative  n  and  all 
integers  k,  by  virtue  of  the  redefinition  of  the  symbol  Tnk  which  was 
made  in  the  preceding  chapter. 

Thus,  we  have  expressed  the  binomial  coefficients  in  terms  of  Pascal’s 
operation.  We  may  now  rewrite  equation  (5.6)  in  the  following  form: 


(1  +  x)n  =  Tn0  +  T\x  +  T\x2  +  ■■■  +  Tnkxk  +  •  •  •  +  T\xn.  (5.14) 


Formula  (5.14)  is  sometimes  called  Newton's  binomial formula,  or  simply 
Newton's  formula.1  Another  more  traditional  expression  of  this  formula 
will  be  presented  in  chapter  7. 

In  a  certain  sense,  this  section  has  provided  a  “solution”  to  the 
problem  of  finding  an  expression  for  the  binomial  coefficient 


Recalling  chapter  2,  we  know  that  we  have  more  than  one  criterion  for 
the  “solution”  of  a  problem.  For  example,  if  (under  the  second  inter¬ 
pretation)  a  solution  is  considered  to  be  an  expression  which  allows  us 

to  get  the  binomial  coefficient  from  n  and  k,  then  is  itself  a 


solution.  If  we  require  that  the  solution  express  in  terms  of  the 

numbers  n  and  k  and  certain  standard  operations  (as  in  the  third  inter¬ 
pretation),  our  concept  of  “solution”  will  depend  on  the  collection  of 
standard  operations  chosen.  If  Pascal’s  operation  is  considered  standard, 
then  (5.13)  is  a  solution  to  the  problem  of  finding  the  binomial  coeffi¬ 
cients  Another  solution  to  this  problem,  corresponding  to  a 
different  collection  of  standard  operations,  will  be  given  in  chapter  7. 


1.  Formula  (5.14)  was  known  long  before  Newton;  in  particular,  Tartaglia  had 
already  mentioned  it.  Newton’s  name  is  connected  with  the  formula  only  because 
he  pointed  out  a  method  of  generalizing  this  formula  to  the  case  of  an  arbitrary 
rational  (including  negative)  exponent  in  1676. 
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The  Number  of 
Subsets  of 
a  Given  Set 


In  mathematics,  any  collection  of  objects  is  called  a  set.  Thus, 

(a)  the  collection  of  all  pages  in  this  booklet, 

(b)  the  collection  of  all  integers, 

(c)  the  collection  of  all  even  numbers, 

(d)  the  collection  of  all  the  pencils  in  a  certain  box 
are  all  sets. 

If  some  object  and  some  set  are  given,  exactly  one  of  the  following 
two  statements  is  true : 

1 .  The  object  belongs  to  the  set. 

2.  The  object  does  not  belong  to  the  set. 

In  the  first  case,  the  object  is  called  an  element  of  the  set.  For  example, 
the  number  3  is  an  element  of  the  set  of  all  integers,  and  is  not  an 
element  of  the  set  of  all  even  numbers. 

It  may  happen  that  all  elements  of  some  set  A  are  elements  of  another 
set  B  (for  instance,  all  elements  of  the  set  of  all  even  numbers  are 
elements  of  the  set  of  all  integers).  In  such  a  case,  the  set  A  is  said  to  be 
contained  in,  or  a  subset  of,  the  set  B.  Obviously,  every  set  is  a  subset 
of  itself.  If  the  set  A  is  a  subset  of  the  set  B,  and  the  set  B  is  a  subset 
of  the  set  A,  then  A  and  B  consist  of  the  very  same  elements,  and  are 
equal. 

Sets  can  be  finite  (like  the  sets  in  examples  (a)  and  (d)  above),  or 
infinite  (like  the  sets  in  examples  (b)  and  (c)  above).  Finite  sets  (and  we 
will  study  only  such  sets  in  this  section)  are  the  subject  of  a  particular 
discipline  in  mathematics — that  of  combinatorial  analysis. 

A  particular  set  stands  out  among  finite  sets:  the  set  containing  no 
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elements,  the  so-called  empty  set.  Thus,  the  possibility  is  taken  into 
account  that  on  opening  the  box  in  example  (d),  we  discover  that  the 
set  of  all  pencils  contained  in  it  is  empty.  The  empty  set  is  considered 
to  be  a  subset  of  every  set. 

If  a  set  is  finite,  then  its  elements  may  be  numbered  to  find  how  many 
elements  are  in  the  set.  A  set  which  consists  of  n  elements  is  called  an 
n-element  set.  The  set  of  pages  of  this  book  is  a  44-element  set,  for 
example,  and  the  empty  set  is  a  zero-element  set. 

Example.  Let  us  examine  a  set  consisting  of  three  objects,  a  pencil, 
a  pen,  and  an  eraser,  and  determine  all  of  its  subsets.  There  is  exactly 
one  zero-element  subset,  the  empty  set.  There  are  exactly  three  one- 
element  subsets  (fig.  6.1). 


1 


Fig.  6.1 


There  are  exactly  three  two-element  subsets  (fig.  6.2). 


Fig.  6.2 
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Finally,  there  is  exactly  one  three-element  subset  (the  set  itself)  (fig.  6.3). 


Thus,  our  set  has  eight  subsets  in  all. 

Let  an  n-element  set  be  given ;  any  A-element  subset  of  it  is  called  a 
combination  of  the  n  given  elements  taken  A:  at  a  time.  It  is  obvious  that 
the  number  of  combinations  of  n  given  elements  taken  A  at  a  time  does 
not  depend  on  the  n  given  elements,  but  only  on  the  numbers  n  and  A. 
The  number  of  combinations  of  n  elements  taken  A  at  a  time  is  denoted 

C\. 

Put  differently,  Cnk  is  the  number  of  A-element  subsets  of  an  n-element 
set.  The  expression  Cnk  is  usually  considered  to  make  sense  for  n  —  0, 
1,  2, . . .,  and  0  <  A  <  n.1 

The  total  number  of  subsets  of  an  n-element  set  will  be  denoted  by 
Cn,  so  that 


C„  =  C"0+  C\  +•••+  C\.  (6.1) 

What  are  the  numbers  Cn  and  C\?  We  can  answer  this  question  in  a 
few  specific  cases  at  once.  From  the  example  just  investigated,  we  know 
that  C3  =  8,  C30  =  C3 a=l,  and  C\  =  C32  =  3. 

Furthermore,  we  can  verify  the  three  properties  listed  below. 

First  property  of  the  number  of  combinations: 

Cm  o  =  Cmm  =  1 .  (6.2) 

Proof  It  is  clear  that  any  /n-element  set  S  has  exactly  one  zero- 
element  subset  (the  empty  set)  and  one  /n-element  subset  (the  set  S 
itself). 

1 .  However,  the  definition  can  be  extended  to  make  sense  for  A  >  n  by  setting  it 
equal  to  zero  in  this  case  (since  for  A  >  n  no  A-element  subset  exists). 
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Without  actually  calculating  the  numbers  C\,  we  shall  now  establish 
two  more  properties  of  these  numbers.  The  proof  of  the  second  property 
is  a  helpful  exercise  in  the  mastery  of  the  concepts  put  forth  in  this 
section;  and  the  third  property,  together  with  the  first,  is  a  basis  for 
calculating  the  numbers  C\. 

Second  property  of  the  number  of  combinations: 

C\  =  C\.k.  (6.3) 

Proof  Let  us  consider  any  n-element  set  M.  We  must  show  that  the 
number  of  ^-element  subsets  of  M  is  equal  to  the  number  of  (n  —  k)- 
element  subsets  of  M.  Let  us  carry  out  the  following  construction 
mentally.  From  paper,  we  cut  out  as  many  squares  as  we  have  ^-element 
subsets  of  our  set  (that  is,  C\)  and  on  each  of  them,  we  write  out  one 
of  these  subsets,  so  that  each  ^-element  subset  will  be  listed  on  exactly 
one  square.  Let  us  also  cut  from  paper  Cnn_fc  circles,  writing  each 
(n  —  A)-element  subset  on  some  circle.  It  is  now  sufficient  for  us  to 
show  that  there  are  equal  numbers  of  circles  and  squares.  For  this 
purpose,  we  lay  all  the  squares  on  a  table,  and  on  each  of  them  we  place 
a  circle,  according  to  the  following  rule:  If  some  ^-element  subset  of 
the  set  M  is  listed  on  a  square,  we  place  on  this  square  the  circle  on 
which  is  listed  the  subset  of  the  set  M  consisting  of  the  remaining 
elements 


a  b  c 
d  e 


Fig.  6.4 


(for  the  case  of  a  five-element  set  M,  consisting  of  the  elements  a,  b,  c, 
d,  e,  several  squares  together  with  their  corresponding  circles  are  shown 
in  figure.  6.4).  It  is  evident  that  on  each  square  there  lies  precisely  one 
circle,  and  that  each  circle  will  be  placed  on  precisely  one  square, 
implying  that  there  are  exactly  as  many  circles  as  squares. 

Before  going  on  to  the  third  property,  let  us  prove  the  following 
lemma. 
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Lemma.  Let  us  choose  some  element  a  in  an  (n  +  1  )-element  set  S. 
The  number  of  k-element  subsets  of  this  set  which  contain  this  chosen 
element  is  equal  to  Cnfc_1. 

Proof  Again,  let  us  conduct  a  mental  experiment  with  circles  and 
squares.  We  cut  from  paper  as  many  squares  as  there  are  k-element 
subsets  containing  the  chosen  element,  and  on  each  of  these  we  list  one 
such  subset,  so  that  each  of  them  will  be  represented  once.  We  then  cut 
from  paper  C\_i  circles,  and  on  each  circle  we  list  one  of  the  (k  —  1)- 
element  subsets  of  the  n-element  set  of  all  unchosen  elements,  so  that 
all  such  subsets  will  be  depicted  (there  are  n  unchosen  elements,  and 
therefore  C"fc_1  such  subsets).  On  each  square  we  place  a  circle,  accord¬ 
ing  to  the  following  rule:  If  some  subset  A  is  depicted  on  a  square,  then 
on  that  square  must  be  placed  the  circle  listing  the  set  derived  from  A 
by  removing  the  chosen  element.  It  is  clear  that  on  each  square  there  lies 
exactly  one  circle,  and  that  each  circle  is  placed  on  exactly  one  square, 
implying  that  the  number  of  squares  and  the  number  of  circles  are  both 
equal  to  C\_1.  Since  we  cut  out  as  many  squares  as  there  are  k-element 
subsets  of  the  ( n  +  l)-element  set  containing  the  chosen  element,  the 
number  of  such  subsets  is  equal  to  Cnfc_x,  which  is  what  we  were 
required  to  prove. 

We  now  go  on  to  the  third  property  of  the  number  C\. 

Third  property  of  the  number  of  combinations: 

Cn+\  =  CVx  +  C\  ,  1  <  k  <  n.  (6.4) 

Proof.  Let  us  take  an  arbitrary  (n  +  l)-element  set  M  and  compile  all 
its  k-element  subsets.  From  M  we  choose  some  element  a.  We  denote  by 
X  the  number  of  k-element  subsets  of  the  set  M  which  contain  the 
element  a,  and  we  denote  by  Y  the  number  of  k-element  subsets  of  the 
set  M  which  do  not  contain  a.  Then 

Cn  +  1k  —  X  +  Y.  (6.5) 

But  by  the  lemma,  X  =  C\_  x.  Moreover,  Y  is  simply  the  number  of 
combinations  of  the  n  unchosen  elements  taken  k  at  a  time,  that  is,  Cnk. 
Therefore, 

Cn+1k  =  Cnk-!  +  Cnk,  (6.6) 

which  is  what  we  wanted  to  show. 

The  third  property,  together  with  the  first,  shows  that  the  sequence 

C‘+10,C*+11,...,C»+1n+i  (6-7) 
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is  derived  from  the  sequence 

C\,C\,...,C\  (6.8) 

by  Pascal’s  relations.  Since  for  n  =  0  the  sequence 

C°  o  (6.9) 

coincides  with  Pascal’s  zeroth  sequence,  we  know  that  for  any  arbitrary 
n  the  sequence  (6.8)  will  coincide  with  Pascal’s  nth  sequence,  and  thus, 

C\  =  T\.  (6.10) 

Thus,  we  have  a  means  of  calculating  the  number  of  k-element  subsets 
of  an  n-element  set,  that  is,  the  number  of  combinations  of  n  elements 
taken  k  at  a  time  (in  this  way,  formula  [6.10]  gives  a  solution  to  the 
“problem  of  the  number  of  combinations,”  under  the  condition  that 
Pascal’s  operation  is  considered  standard).2 

Finally,  the  relations  (6.1)  and  (6.10)  show  that  the  number  of  all 
subsets  of  an  /i-element  set  is  equal  to  the  sum  of  all  the  entries  in 
Pascal’s  nth  sequence.  As  we  know,  this  sum  is  equal  to  2".  Conse¬ 
quently, 


Cn  =  2n. 


(6.11) 


2.  The  reader  will  find  a  different  solution,  with  different  standard  operations, 
in  section  7. 
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In  chapter  4,  two  methods  of  calculating  the  number  Tnk  from  the 
numbers  n  and  k  were  pointed  out:  the  more  “mechanical”  method  of 
writing  out  Pascal’s  triangle  (which,  however,  leads  to  superfluous 
calculations),  and  the  method  more  economical  with  regard  to  the 
number  of  steps  (which,  however,  requires  a  certain  organization  of 
calculations),  consisting  of  repeated  application  of  the  relations  (4.1)- 
(4.3).  These  two  methods  are  very  similar,  for  in  both  cases  the  numbers 
T\  are  obtained  using  Pascal’s  relations.  However,  there  is  another 
method  of  finding  Tnk,  which  we  shall  now  discuss. 

First,  let  us  introduce  a  new  symbol.  We  set 

0!  =  1, 

and  for  each  whole  number  m,  we  define 

ml  —  (m  —  1)1  m  . 

Thus,  for  m  >  0, 

ml  =  1-2 . m  . 

The  expression  ml  is  read  “/n  factorial.” 

We  shall  now  express  Pascal’s  operation  in  terms  of  arithmetic  opera¬ 
tions  and  the  operation  of  taking  factorials.  For  this  purpose,  we 
examine  the  following  expression: 

ml 

ql  (m  -  q)l  ' 
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Let  us  denote  this  expression  by  Fm„.  It  is  clear  that  the  expression 
Fmq  makes  sense  for  m  >  0,  0  <  q  <  m.  We  notice  that 


0! 

0  0!  0! 


Furthermore, 


Fm  0  =  7^-.  =  1  • 


0!  ml 


Finally, 

F\ -x  +  F\  = 


n\ 


pm  j 

m  m\  0! 


z! 


(k  -  1)!(«  -  k  +  1)!  '  k\(n  -  k)\ 

n\  n\ 


(k  -  1)!  (n  -  k)\  (n  -  k  +  1)  1  (k  -  1)!  k(n  -  k)\ 


(k  -  l)!(n  -  k)\ 
n\ 


1 


n  —  k  +  1  +  k 
n  +  1 


(n  +  1)! 


(k  —  1)!  (n  —  k)!  k(n  —  k  +  1)  k\  (n  +  1  —  £)! 

=  Fn+1lc ,  1  <  k  <  n. 

Thus,  the  sequence 

F°  o 

is  Pascal’s  zeroth  sequence,  while  the  (n  +  l)th  sequence 

rn  +  1  I?n  + 1  J?n  +  1 

r  05  2  15  •  •  *5  2  n+1 

is  derived  from  the  nth  sequence 

Fnn  Fn->  Fn 

1  05  L  15  •  *  *5  1  n 

by  Pascal’s  relations.  Therefore,  for  any  m  =  0,  1,  2, . . the  sequence 

zrm  pm  pm 

coincides  with  Pascal’s  with  sequence,  and 


pm  _  pm 

1  Q  J  Q  • 
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9  q\  (m  -  q)\  ' 

We  have  thus  expressed  Pascal’s  operation  in  terms  of  the  operations 
of  taking  factorials,  subtraction,  multiplication,  and  division,  in  the 
sense  that  we  have  found  an  expression  for  Tmq  containing  only  m,  q, 
and  the  symbols  for  the  indicated  operations.  This  permits  us  to  calcu¬ 
late  Tmq  directly,  since  we  are  able  to  calculate  factorials,  differences, 
products,  and  quotients. 

Several  interesting  corollaries  follow  quickly  from  the  formula  for 
Tmq  just  calculated. 

Corollary  1.  Canceling  (m  —  q) !  in  the  numerator  and  denominator 
of  the  expression  for  Tmq,  we  get 

Tm  ml 

9  q\(m-q)\ 

_  m(m  -  1)  -[m  -  (q  -  l)](m  -  q) ! 

q\{m-q)\ 

_  m(m  —  1  )  ■  ■  [m  —  (q  —  1)] 

?! 

_  m{m  —  !)•  •  [m  —  (q  —  1)] 

?•(?-  1)-  ...  -1 

Corollary  2.  Let  m  >  1,  m  >  q  >  1.  The  product  of  the  q  factors 
m(m  —  1)-  •  [m  —  (q  —  1)]  is  always  divisible  by  the  product  of  the  q 
factors  1-2  -q. 

Specifically,  because  of  corollary  1,  the  ratio  of  these  products  is 
equal  to  Tmq,  a  whole  number. 

Corollary  3.  From  the  relation  (4.15),  we  get 

jjiooo  _  !Q00!  . 

9  ql  (1000  -  q)\ 

This  is  a  new  form  of  the  solution  to  the  problem  of  section  1. 

Corollary  4.  From  the  relation  (5.13),  we  get 

tn\  n\  n(n  —  l)--[«  —  (k  —  1)] 

U/  ~  k\(n  -  k)\  1-2-  ...  • k  ' 


The  Connection  with  Factorials 


35 


This  is  the  traditional  high-school  expression  for  the  binomial  co¬ 
efficient. 

Corollary  5.  From  the  relation  (5.14)  and  corollary  1,  we  conclude: 


(1  +  x)n  =  1  +  nx  +  -(”  2  [)x2  +  ■ 


,  n(n  -!)•••[«-  (k  -  1)] 

+  1-2-  ...  -k  X  +' 


'  +  xn . 


This  is  the  traditional  high-school  form  of  Newton’s  binomial  formula. 

Corollary  6.  The  relation  (6.10)  gives  the  traditional  high-school 
formula  for  the  number  of  combinations 


-  nl  -  (fc  ~  I)] 

k  k\  (n  —  k)\  1-2-  ...  k 


